Deriving Ground Truth Labels for Regression Problems Using Annotator Precision

نویسندگان

چکیده

When training machine learning models with practical applications, a quality ground truth dataset is critical. Unlike in classification problems, there currently no effective method for determining single value or landmark from set of annotations regression problems. We propose novel deriving labels problems that considers the performance and precision individual annotators when identifying each label separately. In contrast to commonly accepted computing global mean, our does not assume annotator be equally capable completing specified task, but rather ensures higher-performing have greater contribution final result. The selection described within this paper provides means improving input data model development by removing lower-quality labels. study, we objectively demonstrate improved applying simulated where canonical position can known, as well sample collected crowd-sourced

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Intra-Annotator Rating Consistency Through Copeland's Method for Estimation of Ground Truth Labels in Couples' Therapy

Behavioral and mental health research and its clinical applications widely rely on quantifying human behavioral expressions. This often requires human-derived behavioral annotations, which tend to be noisy, especially when the psychological objects of interest are latent and subjective in nature. This paper focuses on exploiting multiple human annotations toward improving reliability of the ens...

متن کامل

Precision and Recall Without Ground Truth

In this paper we present a way to use precision and recall measures in total absence of ground truth. 1 Precision and Recall 1.1 General Definitions and Notation Precision Pr and Recall Rc (and often associated F-measure or ROC curves) are standard metrics expressing the quality of Information Retrieval methods [8]. They are usually expressed with respect to a query q (or averaged over a series...

متن کامل

Deriving Labels

This paper builds on Hornstein’s (2005) proposal that Merge (Chomsky, 1995) can be split up into the more basic operations Concatenate and Label. This opens up the possibility that Concatenate applies without Label to generate flat structures, an option which Hornstein & Nunes (2005) explore for encoding adjuncts. The central claim is that Label is not needed as a syntactic primitive if a long ...

متن کامل

Using objective ground-truth labels created by multiple annotators for improved video classification: A comparative study

We address the problem of predicting category labels for unlabeled videos in a large video dataset by using a ground-truth set of objectively labeled videos that we have created. Large video databases like YouTube require that a user uploading a new video assign to it a category label from a prescribed set of labels. Such category labeling is likely to be corrupted by the subjective biases of t...

متن کامل

A Ground Truth Inference Model for Ordinal Crowd-Sourced Labels Using Hard Assignment Expectation Maximization

In this paper we propose an iterative approach for inferring a ground truth value of an item from judgments collected form online workers. The method is specifically designed for cases in which the collected labels are ordinal. Our algorithm works by iteratively solving a hard-assignment EM model and later calculating one final expected value after the convergence of the EM procedure.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13169130